Keyword Spotting Using Normalization of Posterior Probability Confidence Measures
نویسندگان
چکیده
Keyword Spotting Using Normalization of Posterior Probability Confidence Measures by Rachna Vijay Vargiya Thesis Advisor: Marius C. Silaghi, Ph.D. Keyword spotting techniques deal with recognition of predefined vocabulary keywords from a voice stream. This research uses HMM based keyword spotting algorithms for this purpose. The three most important componenets of a keyword detection system are confidence measure, pruning technique and evaluation of results. We suggest that best match for a keyword would be an alignment in which all constituent states have high emission probabilities. Therefore score of even the worst subsequence must also be better than a thershold and a path can be represented by the score of its worst subsequence match. This confidence measure is called Real Fitting. The harsher the pruning in a technique, the fewer paths survive. This increases the speed as well as the risk of pruning the best match. Three levels of pruning are explored and results and performance are compared. Since the proposed algorithms do no follow
منابع مشابه
Experimental Investigation into Alignment-based Acoustic Confidence Measures in Keyword Verification for Mandarin Speech
This paper introduces the methods to verify keyword hypothesis using alignment information, which can be easily implemented and integrated into the keyword spotting system in a straightforward way. Acquiring alignment information goes without any extra model training, and has no evident total processing time impact on original keyword spotting system. Different alignment-based confidence measur...
متن کاملCharacter confidence based on N-best list for keyword spotting in online Chinese handwritten documents
In keyword spotting from handwritten documents by text query, the word similarity is usually computed by combining character similarities, which are desired to approximate the logarithm of the character probabilities. In this paper, we propose to directly estimate the posterior probability (also called confidence) of candidate characters based on the N-best paths from the candidate segmentation...
متن کاملA Word-spotting Hypothesis Testing for Accepting/Rejecting Continuous Speech Recognition Output
The word rejection problem in speech recognition is formulated in a framework of word-spotting, where a spotted word is verified through a binary, acceptance/rejection decision. A generalized word posterior probability (GWPP), used as the sole confidence measure, is computed in a word graph, via the forward-backward algorithm or in an N-best list, using string likelihoods. The GWPP is further e...
متن کاملبهبود کارایی سیستم کاوشگر کلمات تلفنی با استفاده از نرمالیزاسیون امتیاز اطمینان مبتنی بر روش برنامهریزی خطی
Conventional word spotting systems determine hypothesized keywords and their confidence score using a speech recognizer. Acceptance or rejection of these keywords is intended based on comparison of their scores with a specific threshold. It has been proved that confidence score prepared by recognizer is highly dependent on sub-word structure of each keyword. So comparing assigned scores to keyw...
متن کاملSpotting Subsequences matching a HMM using the Average Observation Probability Criteria with application to Keyword Spotting
This paper addresses the problem of detecting keywords in unconstrained speech. The proposed algorithms search for the speech segment maximizing the average observation probability along the most likely path in the hypothesized keyword model. As known, this approach (sometimes referred to as sliding model method) requires a relaxation of the begin/endpoints of the Viterbi matching, as well as a...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005